CDS

Accession Number TCMCG074C14078
gbkey CDS
Protein Id KAF8395685.1
Location join(30647894..30648124,30649773..30649851,30650093..30650171,30652482..30652591,30664755..30664847,30668642..30668881,30677203..30677349,30681549..30681637,30682567..30682801,30687856..30688089,30688168..30688421)
Organism Tetracentron sinense
locus_tag HHK36_019635

Protein

Length 596aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA625382, BioSample:SAMN14615867
db_source JABCRI010000013.1
Definition hypothetical protein HHK36_019635 [Tetracentron sinense]
Locus_tag HHK36_019635

EGGNOG-MAPPER Annotation

COG_category G
Description Belongs to the glycosyl hydrolase 2 family
KEGG_TC -
KEGG_Module -
KEGG_Reaction R01105        [VIEW IN KEGG]
R01678        [VIEW IN KEGG]
R03355        [VIEW IN KEGG]
R04783        [VIEW IN KEGG]
R06114        [VIEW IN KEGG]
KEGG_rclass RC00049        [VIEW IN KEGG]
RC00452        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K01190        [VIEW IN KEGG]
EC 3.2.1.23        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00052        [VIEW IN KEGG]
ko00511        [VIEW IN KEGG]
ko00600        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
map00052        [VIEW IN KEGG]
map00511        [VIEW IN KEGG]
map00600        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGAGTGTCGACGCGTGTTGTCCAGATCGTTGTAAATTGCAACCCTTTGCCTCCAATTCAGATCTTCTATGTTCCAGAAATCTTAGCTGCAACCTTGGTAATTCAGCCCCAAACAGAATCTTCTATGTTCCAGAAATCTTAGCTGCAACCGTGGTAATTCATCCCCAAGCAGATGTTGCATGCTGCATACATGCCCTCGCCGGGCTATTGGTGAATACTTATAACTGGACGTTATTGCTTTGTCTATTGAAGTTCTTTAATGGAATATTTGAAACTTTTGCTCCTGCTATATGGATTTATGCCTTTGTAGAACCATCTTGTGGATACACAAAAGCATCTAATCATGAAGATGGGGGTGGGAGAGCGGAGAAGATGCATGCCACATTAAGGTTAGCTGGCTTTCCTGCTTTGAGAGGGAAGTTAGAGGATGGCTTTGCTTCCTCGCATTTTGAACGCTGCACGATCGTAGGCTTTTGTGGCTTCAATGGAGGTGTCAAAGATCATGGTAGTAGAGTCTGGGAAGATCCATCTTTCATTAAATGGAGAAAGAGAGATGCTCATGTTACATTGCATTGCCATGATACAGTTGAAGGATCTCTTAAATTCTGGTATGAACGCAATAAAGTGGACTTTCTGGTATCTAGTTCAGCAGTTTGGAATGATGATGCTGTTCATGGAGCTCTTGATAGTGCTGGTTTCTGGGTCAAGGGCTTGCCTTTTGTTAAGTCCTTGTCTGGCTATTGGAAATTTTTCTTGGCCCCTAGTCCTTCAAGTGTCCCTATGAAATTTTATGATTGTGCATTTGAGGACTCCATGTGGGAAACTTTGCCAGTTCCTTCCAATTGGCAGATGCATGGTTTTGATCGTCCGATTTATACAAACGTTACATATCCATTTCCATTTGATCCACCATATGTGACTACGGACAACCCTACTGGTTGTTACAGGACATGCTTTTATATCCCTACAGAATGGACAGCTGCGCTAGTTGGTATCAGAGCTTTGGAGGCTGTTGTGTTGAAGATCTTCTCCATTTGTTCATGGCTCCTAGACGTGGACGAGGATGAGTATGGGGATCCTGGTCATAATTCGACAAAATGTAGGAAGAATTCGAGCCGTGAGGGCAAGCACTTCTTGATTGGGGAAGGAGAAGATGTGGAGGAAGATGATCTAGATGATGGAGCTGAATGTGATGTGGAGGAAGATGATCTAGATGATGGAGCTGAATGTGATGCGTATGAAGATGATAATGAAGGAGCTATAATACATGGCGATTTTGGTGAAGCACTAGTTCTCCGAAAGAGTGACAATCCTAAACAATGGGATCAGACTCTCTCCCAAGCTGAATTTGCTTTTAATAGGTCCAAGAATCGAACCACTCAATACAGCCCTTTTGAAATTGTGTATGGGCAGAATCCAAATGGTGTTCTTAACTTAGCTCCAATTCCAAACTTGGGAAAGATAAGTGGGAAGGCCGAAGACTTGGGTGAGCACATTAAATCCATTCATGAGCAGGTTCGTCAACAAATTGAAGCTATATTGACAAAGGACCGGTTTCCAGTAAGGGAGTACAACAAGCTTAGTGAAATGAAGATTGGACCATGTGAAGTGATCGCAAAGATCAATGTTAAACACCTAAGTCCATACCTTGGAGACACTTCAGGAGATGAGCTCAAAGGAAACTCGAGGTTGAGTTTTCTTACACATGGGGAGACTGATGCAGCCCTAATAGCCACTGATTTCCTTGTCAAGAGGGATCGAATGAGGCGACCAAGAATGCGGAGGACATGA
Protein:  
MSVDACCPDRCKLQPFASNSDLLCSRNLSCNLGNSAPNRIFYVPEILAATVVIHPQADVACCIHALAGLLVNTYNWTLLLCLLKFFNGIFETFAPAIWIYAFVEPSCGYTKASNHEDGGGRAEKMHATLRLAGFPALRGKLEDGFASSHFERCTIVGFCGFNGGVKDHGSRVWEDPSFIKWRKRDAHVTLHCHDTVEGSLKFWYERNKVDFLVSSSAVWNDDAVHGALDSAGFWVKGLPFVKSLSGYWKFFLAPSPSSVPMKFYDCAFEDSMWETLPVPSNWQMHGFDRPIYTNVTYPFPFDPPYVTTDNPTGCYRTCFYIPTEWTAALVGIRALEAVVLKIFSICSWLLDVDEDEYGDPGHNSTKCRKNSSREGKHFLIGEGEDVEEDDLDDGAECDVEEDDLDDGAECDAYEDDNEGAIIHGDFGEALVLRKSDNPKQWDQTLSQAEFAFNRSKNRTTQYSPFEIVYGQNPNGVLNLAPIPNLGKISGKAEDLGEHIKSIHEQVRQQIEAILTKDRFPVREYNKLSEMKIGPCEVIAKINVKHLSPYLGDTSGDELKGNSRLSFLTHGETDAALIATDFLVKRDRMRRPRMRRT